Prediction of missing values. Comparison of approaches

نویسندگان

چکیده

We compare four numerical methods for the prediction of missing values in different datasets [1]. These are 1) hierarchical maximum likelihood estimation (ℋ-MLE), and three machine learning (ML) methods, which include 2) k-nearest neighbors (kNN), 3) random forest, 4) Deep Neural Network (DNN). From ML best results (for considered datasets) were obtained by kNN method with (or seven) neighbors. On one dataset, MLE showed a smaller error than method, whereas, on another, was better. The requires lot linear algebra computations works fine almost all datasets. Its result can be improved taking threshold more accurate matrix arithmetics. To our surprise, well-known produces similar as ℋ-MLE worked much faster.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

comparison of catalytic activity of heteropoly compounds in the synthesis of bis(indolyl)alkanes.

heteropoly acids (hpa) and their salts have advantages as catalysts which make them both economically and environmentally attractive, strong br?nsted acidity, exhibiting fast reversible multi-electron redox transformations under rather mild conditions, very high solubility in polar solvents, fairly high thermal stability in the solid states, and efficient oxidizing ability, so that they are imp...

15 صفحه اول

a comparison of linguistic and pragmatic knowledge: a case of iranian learners of english

در این تحقیق دانش زبانشناسی و کاربردشناسی زبان آموزان ایرانی در سطح بالای متوسط مقایسه شد. 50 دانش آموز با سابقه آموزشی مشابه از شش آموزشگاه زبان مختلف در دو آزمون دانش زبانشناسی و آزمون دانش گفتار شناسی زبان انگلیسی شرکت کردند که سوالات هر دو تست توسط محقق تهیه شده بود. همچنین در این تحقیق کارایی کتابهای آموزشی زبان در فراهم آوردن درون داد کافی برای زبان آموزان ایرانی به عنوان هدف جانبی تحقیق ...

15 صفحه اول

A Comparison of Several Approaches to Missing Attribute Values in Data Mining

In the paper nine different approaches to missing attribute values are presented and compared. Ten input data files were used to investigate the performance of the nine methods to deal with missing attribute values. For testing both naive classification and new classification techniques of LERS (Learning from Examples based on Rough Sets) were used. The quality criterion was the average error r...

متن کامل

A comparison of traditional and rough set approaches to missing attribute values in data mining

Real-life data sets are often incomplete, i.e., some attribute values are missing. In this paper we compare traditional, frequently used methods of handling missing attribute values, which are based on preprocessing, with another class of methods dealing with missing attribute values in which rule induction is performed directly on incomplete data sets, i.e., handling missing attribute values a...

متن کامل

Mining Incomplete Data with Many Missing Attribute Values A Comparison of Probabilistic and Rough Set Approaches

In this paper, we study probabilistic and rough set approaches to missing attribute values. Probabilistic approaches are based on imputation, a missing attribute value is replaced either by the most probable known attribute value or by the most probable attribute value restricted to a concept. In this paper, in a rough set approach to missing attribute values we consider two interpretations of ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings in applied mathematics & mechanics

سال: 2021

ISSN: ['1617-7061']

DOI: https://doi.org/10.1002/pamm.202100043